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Abstract— In today’s e-commerce in Nigeria, customers access online stores to browse through and place 
orders for products or services via the internet on their devices while some are skeptical due to the 
experiences from what I ordered versus what I got syndrome. Though this method of business has flourished 
to an extent, it greatly faces a crucial challenge in unravelling consumer’s sentiments particularly in the 
realm of product reviews. This deficiency inhibits most e-commerce platforms in Nigeria from gaining 
effective sensitivity into users’ preferences, thus, limiting their ability to boost their product 
recommendations and, understand and improve customers’ experiences. This research aims to bridge this 
gap by developing a sentiment analyzer of product in the e-commerce domain using Natural language 
processing and machine learning approach. The model will analyze the customers’ reviews based on positive 
or negative. The experimental data was collected from kaggle.com. Stemming and lemmatization were 
approaches used for cleaning the collected data. Features were extracted and transformed using 
CountVectorizer. Gaussian Naive Bayes classifier was used as the machine learning technique. The model’s 
performance was evaluated and it returned 90% of accuracy, hence, an efficient and reliable model for 


product review sentiment analysis is developed. 
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I. INTRODUCTION 


Commerce as the engine that drives the economy of 
societies by promoting business inventions has brought 
about advanced growth in the nation. It has developed from 
the trade-off systems to a more global network of businesses 
from the inception of the internet over the years. This act of 
encompassing commerce by the internet gave birth to 
electronic commerce. Electronic commerce (e-commerce) 
is the buying and selling of goods and services, or the 
transmitting of funds or data over an electronic network, 
primarily the internet (Lutkevich et al., 2023) via devices. 
This transaction occurs either as business-to-business 
(B2B), business-to-consumer (B2C), consumer-to- 
consumer (C2C), consumer-to-business (C2B), business-to- 
administration (B2A), consumer-to-administration (C2A) 
or mobile e-commerce (m-commerce). 


No matter the type of e-commerce a consumer delves into, 
the benefits are easy and fast accessibility, wide visibility of 
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goods and services, and 24/7 hours market service 
availability. Despite these advantages, it is still greatly 
challenged in the areas of cybersecurity, competition, order 
fulfillment, customer’s experience, quality website traffic 
and visitor conversion, visibility, return and refund policies, 
finding the right market, making and increasing sales, 
borderless e-commerce and augmented reality (Post, 2023). 
Amongst these challenges, the most troubling issue is 
customer’s experience. Presently, customers that would like 
to order products online are skeptical in doing such because 
of the experiences many encountered with ordering 
products online. One of such is receiving an item not in the 
form of what was initially shown to the buyer and not 
having the opportunity to lay compliant in that regards. This 
syndrome was tagged with a slogan termed “what I ordered 
versus what I got”. 


In order to find the product-market fit, the reviews of the 
customers are very much needed. It is the first step of any 
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business, and e-commerce is not different. It is an important 
metric that helps a business manager address customer’s 
issue, prevent churn and build a base of loyal customers 
(Needle, 2023). One will learn what the customers like 
about their offers and areas where they can improve, 
thereby, providing a method of facilitating development. 
This development involves personal growth, motivation, 
creativity, morale, job satisfaction, clarity, professional 
relationships, meaningful discussion and goal alignment. It 
is important to note that in absence of a product-market fit, 
some of the challenges such as competition and poor sales 
of products are as a result of the shortcomings of customer 
review. 


In the light of this grave challenge, there is a need to create 
the so much needed avenue that can handle the review of 
customers in order to boost productivity in business. 


I. LITERATURE REVIEW 


Artificial Intelligence (AI) which is sometimes called 
machine intelligence is the intelligence demonstrated by 
machines in contrast to the natural intelligence displayed by 
humans and other animals (Ziyad, 2019). It handles the 
important question of what knowledge is needed in any 
mode of thinking and how should that knowledge be 
applied. The basic components of AI are learning, reasoning 
and decision making, problem solving and perception. 
There are six main branches of AI. They are machine 
learning, deep learning, expert systems, Natural Language 
Processing (NLP), Robotics and Fuzzy logic (Tyagi, 2020). 
Amongst these branches, the one that addresses 
interpretation, comprehension and manipulation from 
human language in order to resolve the issue of customers’ 
review is the NLP. 


Natural Language Processing 


Natural Language Processing is a subset of Artificial 
Intelligence that facilitates seamless communication 
between humans and computers by granting machines the 
capacity to interpret, comprehend, generate, manipulate and 
extract significance from human languages (Oracle, 2022). 
It has become a crucial area of research and development 
that merges interdisciplinary fields in computer science, 
linguistics, and cognitive psychology. Due to the available 
large volume of text and voice data in various forms, such 
as posts from online or social media and news articles, NLP 
covers different tasks such as text analysis, speech 
recognition, language translation, text summarization, and 
sentiment analysis. 


Sentiment Analysis 


Sentiment analysis is literally known as opinion mining. It 
is the process of determining the emotional tone behind a 
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piece of text, whether it is positive, negative, or neutral 
(Biyani 2023). Naturally, companies receive feedbacks 
from countless sources, including customer messages, call 
center and social media posts. These sources are the 
different views of people about the companies’ products. 
Based on the analysis of such sources, the companies gain 
valuable insights in their businesses as they identify areas 
for improvement in terms of products and rendered services, 
thereby, creating and promoting the overall satisfaction for 
their customers. 


In situations where very large volume of textual data are 
collected, the act of manually analyzing these textual data 
is not feasible. In order to effectively analyze such, 
statistical, natural language processing and machine 
learning techniques are applied to determine the emotional 
meaning of communications (Korolov, 2021). Statistical 
methods are used to estimate and evaluate the occurrences 
of positive, negative, or neutral sentiments in a given 
dataset, machine learning algorithms (supervised) train the 
labeled datasets to recognize the available patterns and 
make predictions while NLP techniques enables the system 
to have an extensive acumen of the emotional context 
within the text of the human language by identifying words, 
phrases and linguistic constructs that are sentiment based. 
Sentiment analysis automates the process of analyzing large 
volumes of text data and extracting feelings, emotions and 
attitudes about a product or services generated by depending 
on NLP, thereby, saving time and effort when compared to 
manual analysis. 


Review of Related Works 


Numerous studies have explored the use of sentiment 
analysis for product in e-commerce. Some of them are: 


Alharbi et al. (2021) conducted a research aimed at 
accurately forecasting customer feedback based on 
smartphone reviews gathered from Amazon.com. Deep 
Learning (DL) methods were employed in this study, 
focusing on the analysis and categorization of reviews into 
three classes: positive, neutral, and negative. The authors 
explored various DL approaches including Recurrent 
Neural Networks (RNN) and its four variants—Gated 
Recurrent Unit (GRU), Long Short Term Memory (LSTM), 
Update Recurrent Neural Netwok (UGRNN), and Group 
Long Recurrent Neural Network (GLRNN). These 
algorithms were complemented by word embedding 
techniques, utilizing Glove, word2vec, and FastText as 
feature extraction approaches. 


Norrega et al. (2023) delved into a comprehensive 
investigation into Amazon reviews, employing advanced 
deep learning techniques. Alongside the primary aim of the 
research, a randomly relatable objective was to explore the 
impact of sentiment analysis on customer satisfaction in e- 
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commerce. The models utilized in this study encompassed 
Bidirectional Encoder Representations from Transformers. 
(BERT), Robustly optimized BERT approach (RoBERTa), 
ULMFiT and Extra-Long Network (XLNet), representing a 
diverse set of deep learning frameworks. The research 
unfolded through four distinct stages. The initial stage, Data 
Validation, involved the authors dividing the Amazon 
dataset, sourced from GitHub, into training and testing 
subsets, labeled as 0 or 1 to denote negative or positive 
reviews. Following this, the Statistical Analysis stage 
ensured the avoidance of overfitting, a critical consideration 
to prevent misleading results. Moving forward, the third 
stage involved Exploratory Data Analysis (EDA), where the 
authors employed TF-IDF and K-means clustering to sift 
through the data and eliminate irrelevant words, enhancing 
the precision of the analysis. The culmination of these 
stages led to the Approach stage, where the authors applied 
the proposed models to discern their respective 
performances. Notably, the ROBERTa model emerged as a 
standout performer, achieving an impressive overall 
classification accuracy of 82%. 


Prakash and Aloysius (2021) introduced a noteworthy 
contribution to sentiment analysis on tweets by proposing a 
lexicon-based approach. Their endeavor aimed at tackling 
the pivotal challenges in sentiment analysis, particularly 
focusing on optimizing the performance of lexicon-based 
methodologies. 


Tang et al. (2016). proposed a deep learning-based 
sentiment analysis model for product reviews that 
incorporates attention mechanisms to focus on sentiment- 
bearing words and phrases, improving the accuracy of 
sentiment classification. The authors developed a sentiment 
analysis model based on convolutional neural networks 
(CNNs) with attention mechanisms. CNNs are effective at 
extracting local patterns from text, while attention 
mechanisms allow the model to focus on the most important 
parts of a sentence for sentiment classification. The model 
first extracts local features from product reviews using 
CNNs. Then, an attention mechanism is applied to assign 
weights to these features based on their importance for 
sentiment classification. The final sentiment polarity of the 
review is determined based on the weighted combination of 
the extracted features. 


Vangheese and Nellasivan (2023) proposed a sentiment 
analysis of tweets concerning food delivery services 
employing lexicon-based approaches. The primary 
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objective was to discern consumer perceptions towards two 
prominent food delivery brands, namely Swiggy and 
Zomato. Utilizing the R programming language, the authors 
collected data directly from the two brands’ Twitter feeds 
for analysis. The collected data underwent thorough 
cleaning and pre-processing using various R language 
techniques to ensure data integrity. Subsequently, the 
authors employed a lexicon-based method for emotion 
classification, identifying terms such as "positive," "anger," 
and "joy" to categorize the sentiments expressed in the 
tweets. Post-analysis, it was revealed that the Swiggy brand 
outperformed Zomato, garnering a higher number of 
positive tweets. This outcome underscores the effectiveness 
of the lexicon-based approach in gauging and comparing 
consumer sentiments in the context of food delivery 
services. 


Il. MATERIALS AND METHODS 
Data Collection 


Two separate datasets were collected: the first dataset being 
a labeled dataset from yelp.com that was obtained from 
Kaggle.com a reputable website and an excerpt from 
reviews attained from amazon.com served as the second 
dataset. The two datasets were merged into one dataset 
called “training dataset.csv” for the purpose of this 
research (Fig.1). 


Objectives of the Design 


In order to achieve the development of a product review 
sentiment analyzer, the following objectives were carried 
out: 


a) design a sentiment analyzer using Gaussian Naive 
bayes model. 

b) to preprocess the dataset using lemmatization and 
tokenization methods of NLP. 

c) to extract and transform features using 
CountVectorizer. 

d) design a user-friendly interface that can capture 
both individual and batch sentiments using 
Streamlit. 

e) evaluate the performance of the model using 
accuracy, precision, recall and F1-score. 


Fig. 2 shows the model of the product review sentiment 
analyzer. 
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Jupyter sentiment analysis with gaussian naive bayes 2 Last Checkpoint: 2 days ago 


File Edit 
B + X 


| [85]: 


View Run Kernel 


oO > u. 


Settings Help 


c 


» Code 


import pandas as pd 
import numpy as np 


dataset = pd.read_csv('C:/Users/user/Desktop/training csv.csv') 
dataset2 = 
dataset3 = 


pd.read_csv(*C:/Users/user/Desktop/training_csv_2.csv") 
pd.read_csv('C:/Users/user/Desktop/training_csv_3.csv') 


dataset. review.value_counts 


<bound method IndexOpsitixin.value_counts of @ 


1 Due to a software issue between Nokia and Spri... 
2 This is a great, reliable phone. I also purcha... 
3 I love the phone and all, because I really did... 
4 The phone has been great for every purpose it ... 


Best phone at this price. 


71917 
71918 If you intend to use this phone on T Mobile be... 
71919 Here is my Moto G7 Play complaint: It freezes 
71920 As far as function works great camera no go wo 


71921 What a great phone! Sleek, fast, great soundin... 
Name: review, Length: 71922, dtype: object> 


dataset2.rename(columns={'text': 'review'}, inplace-True) 
dataset3.rename(columns={'text': 'review'}, inplace=True) 
dataset2.head() 


review sentiment 


Fig. 1: Merging of the two datasets obtained from kaggle.com 
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CountVectorization 


Data Splitting 


| 


Performance Evaluation 


Preprocessing 


Model 


Accuracy 
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Gaussian Naives Bayes 


Precision 


Stemming 


l 


Recall 


F1-Score 


Lemmatization 


Classes 


Fig. 2: Product Review Sentiment Analyzer 


Preprocessing 


Data in its unprocessed form might contain some features 
which are not relevant to the research. The first stage of 
preprocessing was cleaning of the dataset whereby the 
columns of the data were renamed for easy interpretation as 
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seen in Fig. 3 and removing duplicate data, handling null 
values, and cleaning non-alphabetic characters, as well as 
converting text to lowercase were implemented to reduce 
bias and inconsistencies which may affect the performance 
of the model. 
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~ Jupyter sentiment analysis with gaussian naive bayes 2 Last Checkpoint: last month 
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dataset2.rename(columns={'text': 'review'}, inplace=True) 
dataset3.rename(columns={'text': 'review'}, inplace=True) 
dataset2.head() 


review sentiment 


0 "Flavor was too overpowering, masking the subt... negative 
1 “Packaging misled on portion size. Much smalle... negative 
2 “Texture was gritty, indicating poor quality i... negative 
3 “Product lacks nutritional information. Leaves... negative 
4 “The expiration date was obscured by the packa... negative 
dataset3.head() 


review sentiment 
0 Wow... Loved this place. positive 


1 Crust isnot good. negative 


Fig.3: Code block of cleaning of the dataset 


The preprocessing techniques such Lemmatization and Tokenization were applied to reduce words to its base or root form and 
convert a sentence into a list of words respectively as shown in Fig. 4. 


< Jupyter sentiment analysis with gaussian naive bayes 2 tast Checkpoint: last month e 

File Edit View Run Kernel Settings Help Trusted 

@+xX OO > = C w Code v jupytertab C3 @& Python 3 (ipykemel O 
corpus a 


for i in range(@,1851): 


= re.sub('[^a-zA-1]' 


review. lower(} 
W = review.split() 
review - [ps. 


em(word) for word in review If not word in set(all_stoprords) 
review = * join(review) 


corpus. append(review) 


corpus 


xtur nasti’, = 


‘not tasti 
op late may bank holiday rick steve recommend love', 
select menu great price’, 


“get angri want damn pho’, 

*honeslti tast fresh’, 

“potato like rubber could tell made ahead time kept warmer’, 
‘fri great’, 
“great touch’, 

‘servic prompt’, 

d not go back’, 

ier care ever say still end wayyy overpr’, 


tri cape cod ravoli chicken cranberri mmmn', 


‘disgust pretti sure human hair’, 
“shock sign indic cash", 


'hiobhli marnmmond* 


Fig. 4: Code Block for Data Preprocessing 


Data Splitting and testing sets in the ratio of 80%:20% of data for each 


For effective model evaluation, a randomized split with a respectively (Fig. 5). This approach ensures reproducibility. 


fixed random state was used to split the dataset into training 
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File Edit View Run Kernel Settings Help Trusted 
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sdi 7 dae = 

from sklearn.model_selection import train_test_split 

X_train, X_test, y_train, y_test = train_test_split(x, y, test_size = 6.26, random_state = 42) 


£ S import GaussianNe 

= GaussianN8g( 

«fit(x_train, y_train 
GaussianNB 

|GaussianNB() 

import joblib 


joblib.dump(classifier, 'C:/Users/user/Desktop/montana_ NB 


['C:/Users/user/Desktop/montana_NB_classifier Model 2°] 


Fig. 5: Data Splitting Code Block 


(Fig. 6). This transformation facilitated machine learning 
model comprehension of textual data. It allowed extraction 
of meaningful patterns from text, aligning with the 
numerical nature of machine learning models and captured 
word frequencies for sentiment determination. Figs. 7a and 
b show word cloud for positive and negative reviews. 


Feature Extraction 


CountVectorizer was employed to convert raw text into a 
numerical format suitable for model’s training. This 
technique created a matrix of token counts, where each row 
represented a document and each column a unique word 


m~ s 3 a 7 
oe J pyter sentiment analysis with gaussian naive bayes 2 tast Checkpoint: last month dl 
File Edit View Run Kernel Settings Help Trusted 
@+x @ © >» e C h Code telab © beds 
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Fig.6: Feature Extraction using CountVectorizer 
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Fig. 7a: WorldCloud for Positive Reviews 
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WordCloud for Negative Reviews 


Fig. 7b: WorldCloud for Negative Reviews 


Model Selection 


The Gaussian Naive Bayes (GNB) was adopted. The choice 
was motivated by its suitability for handling numerical 
features which aligns with the transformed data obtained 
from features extractions using techniques like 
CountVectorizer and particularly effective in scenarios 
where features follow a Gaussian distribution. 


Training 

The model was trained using the training dataset. The 
process involved exposing the model to the labeled dataset, 
thereby, allowing it to learn patterns and relationships 
between the numerical features and sentiment labels 


(positive and negative). For the testing, the testing dataset 
was used to test the model. 


Performance Evaluation 


The model was evaluated using accuracy, precision, recall, 
and Fl-score as the evaluation metrics to gauge the model's 
effectiveness in classifying sentiments. 


Precision: This is used to determine the exactness of the 
measurement. It measures the accuracy of the positive 
predictions. It is calculated as: 


Choose an option 


Single Text Analysis 


True Positives 


Precision = 7 7 
(True Positives+False Positives) 


(1) 

Recall: This measures the completeness of positive 
predictions, that is, measure of how well a model correctly 
identifies True Positives. 


True Positives 
Recall = 


True Positives+False Positives 
(2) 
Fl-Score: A measure of a model’s accuracy on a dataset. It 
is a harmonic means of both precision and recall of the 
model. It is determined as: 


(precision x recall) 


F1score = 2X 


(precision+recall) 


(3) 


IV. RESULTS AND DISCUSSION 


For the user interface, Fig. 8 shows the user interface 
featuring the analysis of a customer’s review. The user 
interface accepts both individual and batch sentiments 
depending on the input at hand. 


Deploy 


Sentiment Analysis App 


Single Text Analysis 


Enter a text for analysis 


I think the new ingredient is bad and should not have been added to this chips 


Sentiment: negative 


Fig. 8: A User-friendly interface that captures both individual and batch sentiments 
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For batch analysis, Fig. 9 shows the sample of customers’ analyzed sentiments. 
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“Product lacks n nal information. Le consumers in the dark about content! negative 


Fig. 9: Sample of analyzed sentiments 


The evaluation of the performance of the model returned distribution and proportion of the positive and negative 
90% as accuracy (Fig. 10). This shows the effectiveness and reviews are presented on Figs. 11 and 12 respectively. The 
reliability of the model in analyzing sentiments of accuracy of the model returned 90% to show its efficiency. 
customers for the products received. The results of the 
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Fig. 10: Performance Evaluation of the model 


Eo Jupyter sentiment analysis with gaussian naive bayes 2 Last Checkpoint: 2 days ago 
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plt.title('Distribution of Negative and Positive Reviews") 
plt.show() 


Distribution of Negative and Positive Reviews 


count 


positive negative 
sentiment 


Fig. 11: Bar Chart showing the Distribution of Negative and Positive Reviews 


This article can be downloaded from here: www.ijaems.com 40 
©2024 The Author(s). Published by Infogain Publication, This work is licensed under a Creative Commons Attribution 4.0 License. 


http://creativecommons.org/licenses/by/4.0 


Nkemdilim and Michael 


International Journal of Advanced Engineering, Management and Science, 10(3) -2024 


= Jupyter sentiment analysis with gaussian naive bayes 2 Last Checkpoint: 2 days ago 


File Edit View Run Kernel Settings Help 
a+xX 0 O > m C h Code v 


JupyterLab (7 2 


Proportion of Negative and Positive Reviews 


positive 


count 


negative 


Fig. 12: Pie Chart of the Proportion of Negative and Positive Reviews 


V: CONCLUSION 


The research shows the feasibility and effectiveness of 
utilizing Gaussian Naive Bayes in sentiment analysis for e- 
commerce product review. The user interface enhances 
accessibility, enabling users to analyze sentiments for both 
individual and across large datasets. The outcomes of this 
research contributed to the advancement of sentiment 
analysis techniques in the context of e-commerce, thereby, 
providing valuable insights into customer preferences and 
sentiments. 
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